Exploration in the Dark: Reasoning about Planning Strategies
نویسندگان
چکیده
We present a new approach to reasoning about planning strategies in multiagent domains: an agent learns a boundedly-optimal planning strategy, relative to the current state of the environment and its goals, using a myopic attitude based on its repertoire of planning policies. In self-play, our approach also allows boundedly-optimal coordination. In terms of the exploration-exploitation tradeoff, the agent need not generate a model of its environment (nor of other domain agents) as part of its exploratory activities, which allows efficient exploration as well as tractability in real-world scenarios. As complement to the above approach, we present the Leap-and-Stride strategy, a novel probabilistic strategy that serves agents in situations where they can make no assumptions about their surroundings, either due to the lack of a priori knowledge about the environment, or because its complex and dynamic nature renders it inherently unpredictable. The Leap-and-Stride strategy interleaves exploratory activities into the agent’s planning logic, which enables it to converge to effective modes of interaction with its environment through trial-and-error. This strategy is tunable in the sense that the agent can control the intensity of its exploratory activities according to a risk management policy. We present theoretical results regarding the complexity of Leap-and-Stride, and demonstrate its viability through two sets of empirical studies.
منابع مشابه
Different Task Complexity Factors and Cognitive Individual Differences: The Effects on EFL Writers’ Performance
This study aimed at examining the main and interaction effects of increased intentional reasoning demands, planning time, and also language learning aptitude on syntactic complexity, accuracy, lexical complexity, and fluency (CALF) of 226 EFL learners’ performance on letter writing tasks. The participants were first randomly assigned to three experimental groups to be given a task with differin...
متن کاملReasoning in complex environments with the SelectScript declarative language
SelectScript is an extendable, adaptable, and declarative domainspecific language aimed at information retrieval from simulation environments and robotic world models in an SQL-like manner. In this work we have extended the language in two directions. First, we have implemented hierarchical queries; second, we improve efficiency enabling manual design space exploration on different “search” str...
متن کاملTowards a Sustainable Anti-Corruption Strategy: An Ethic-Induced Model
Literature abounds to show that the current anti-corruption strategies have failed to fight corruption because of neglect of ethics in these strategies, despite its importance. The purpose of this paper is to make a contribution to anti-corruption theory by developing a model that clarifies many complex ethical dilemmas around corruption. To develop a conceptual model, the extant literatures on...
متن کاملExploration of Factors Promoting and Inhibiting Fast Food Consumption among Adolescents
Introduction: In recent years, fast food consumption has increased among adolescents and it has become a concern, a health threat, and a major health problem. There are few studies and evidences about factors promoting and inhibiting the consumption of fast food. This study aimed to identify factors promoting or inhibiting the consumption of fast food among adolescents. Method: This qualitative...
متن کاملتوانایی استدلال اخلاقی دانشجویان پرستاری دانشگاه علوم پزشکی شهید صدوقی یزد
Abstract Background & Aim: The setting of nursing care provision is full of ethical dilemas which requires moral reasoning ability. This study was designed to evaluate the moral reasoning ability of nursing students of Shahid Sadoughi University of Medical Sciences in Yazd city. Material & Methods: It was a descriptive correlational study that was conducted in 2012. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008